Dataset statistics
| Number of variables | 46 |
|---|---|
| Number of observations | 7728394 |
| Missing cells | 12840498 |
| Missing cells (%) | 3.6% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 2.0 GiB |
| Average record size in memory | 277.0 B |
Variable types
| Text | 8 |
|---|---|
| Categorical | 10 |
| DateTime | 3 |
| Numeric | 12 |
| Boolean | 13 |
Country has constant value "US" | Constant |
Turning_Loop has constant value "False" | Constant |
Severity is highly imbalanced (55.4%) | Imbalance |
Amenity is highly imbalanced (90.3%) | Imbalance |
Bump is highly imbalanced (99.4%) | Imbalance |
Give_Way is highly imbalanced (95.7%) | Imbalance |
Junction is highly imbalanced (62.0%) | Imbalance |
No_Exit is highly imbalanced (97.5%) | Imbalance |
Railway is highly imbalanced (92.8%) | Imbalance |
Roundabout is highly imbalanced (99.9%) | Imbalance |
Station is highly imbalanced (82.5%) | Imbalance |
Stop is highly imbalanced (81.7%) | Imbalance |
Traffic_Calming is highly imbalanced (98.9%) | Imbalance |
End_Lat has 3402762 (44.0%) missing values | Missing |
End_Lng has 3402762 (44.0%) missing values | Missing |
Weather_Timestamp has 120228 (1.6%) missing values | Missing |
Temperature(F) has 163853 (2.1%) missing values | Missing |
Wind_Chill(F) has 1999019 (25.9%) missing values | Missing |
Humidity(%) has 174144 (2.3%) missing values | Missing |
Pressure(in) has 140679 (1.8%) missing values | Missing |
Visibility(mi) has 177098 (2.3%) missing values | Missing |
Wind_Direction has 175206 (2.3%) missing values | Missing |
Wind_Speed(mph) has 571233 (7.4%) missing values | Missing |
Precipitation(in) has 2203586 (28.5%) missing values | Missing |
Weather_Condition has 173459 (2.2%) missing values | Missing |
Distance(mi) is highly skewed (γ1 = 20.38575876) | Skewed |
Precipitation(in) is highly skewed (γ1 = 85.99914012) | Skewed |
ID has unique values | Unique |
Distance(mi) has 3302161 (42.7%) zeros | Zeros |
Wind_Speed(mph) has 961643 (12.4%) zeros | Zeros |
Precipitation(in) has 4991718 (64.6%) zeros | Zeros |
Reproduction
| Analysis started | 2024-06-18 19:05:25.798082 |
|---|---|
| Analysis finished | 2024-06-18 19:21:10.380993 |
| Duration | 15 minutes and 44.58 seconds |
| Software version | ydata-profiling v4.8.3 |
| Download configuration | config.json |
ID
Text
UNIQUE 
| Distinct | 7728394 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 59.0 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 8.8574952 |
| Min length | 3 |
Characters and Unicode
| Total characters | 68454213 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 7728394 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | A-1 |
|---|---|
| 2nd row | A-2 |
| 3rd row | A-3 |
| 4th row | A-4 |
| 5th row | A-5 |
| Value | Count | Frequency (%) |
| a-1 | 1 | < 0.1% |
| a-19 | 1 | < 0.1% |
| a-7 | 1 | < 0.1% |
| a-8 | 1 | < 0.1% |
| a-9 | 1 | < 0.1% |
| a-10 | 1 | < 0.1% |
| a-11 | 1 | < 0.1% |
| a-12 | 1 | < 0.1% |
| a-13 | 1 | < 0.1% |
| a-14 | 1 | < 0.1% |
| Other values (7728384) | 7728384 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 7728394 | |
| - | 7728394 | |
| 1 | 5665623 | |
| 2 | 5665381 | |
| 4 | 5658416 | |
| 5 | 5650411 | |
| 3 | 5645102 | |
| 6 | 5639671 | |
| 7 | 5414701 | |
| 0 | 4553775 | |
| Other values (2) | 9104345 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 68454213 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 7728394 | |
| - | 7728394 | |
| 1 | 5665623 | |
| 2 | 5665381 | |
| 4 | 5658416 | |
| 5 | 5650411 | |
| 3 | 5645102 | |
| 6 | 5639671 | |
| 7 | 5414701 | |
| 0 | 4553775 | |
| Other values (2) | 9104345 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 68454213 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 7728394 | |
| - | 7728394 | |
| 1 | 5665623 | |
| 2 | 5665381 | |
| 4 | 5658416 | |
| 5 | 5650411 | |
| 3 | 5645102 | |
| 6 | 5639671 | |
| 7 | 5414701 | |
| 0 | 4553775 | |
| Other values (2) | 9104345 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 68454213 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 7728394 | |
| - | 7728394 | |
| 1 | 5665623 | |
| 2 | 5665381 | |
| 4 | 5658416 | |
| 5 | 5650411 | |
| 3 | 5645102 | |
| 6 | 5639671 | |
| 7 | 5414701 | |
| 0 | 4553775 | |
| Other values (2) | 9104345 |
Source
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 59.0 MiB |
| Source1 | |
|---|---|
| Source2 | |
| Source3 | 97389 |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Characters and Unicode
| Total characters | 54098758 |
|---|---|
| Distinct characters | 9 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Source2 |
|---|---|
| 2nd row | Source2 |
| 3rd row | Source2 |
| 4th row | Source2 |
| 5th row | Source2 |
Common Values
| Value | Count | Frequency (%) |
| Source1 | 4325632 | |
| Source2 | 3305373 | |
| Source3 | 97389 | 1.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| source1 | 4325632 | |
| source2 | 3305373 | |
| source3 | 97389 | 1.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 7728394 | |
| o | 7728394 | |
| u | 7728394 | |
| r | 7728394 | |
| c | 7728394 | |
| e | 7728394 | |
| 1 | 4325632 | |
| 2 | 3305373 | |
| 3 | 97389 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 54098758 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| S | 7728394 | |
| o | 7728394 | |
| u | 7728394 | |
| r | 7728394 | |
| c | 7728394 | |
| e | 7728394 | |
| 1 | 4325632 | |
| 2 | 3305373 | |
| 3 | 97389 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 54098758 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| S | 7728394 | |
| o | 7728394 | |
| u | 7728394 | |
| r | 7728394 | |
| c | 7728394 | |
| e | 7728394 | |
| 1 | 4325632 | |
| 2 | 3305373 | |
| 3 | 97389 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 54098758 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| S | 7728394 | |
| o | 7728394 | |
| u | 7728394 | |
| r | 7728394 | |
| c | 7728394 | |
| e | 7728394 | |
| 1 | 4325632 | |
| 2 | 3305373 | |
| 3 | 97389 | 0.2% |
Severity
Categorical
IMBALANCE 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 59.0 MiB |
| 2 | |
|---|---|
| 3 | |
| 4 | 204710 |
| 1 | 67366 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 7728394 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 3 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 6156981 | |
| 3 | 1299337 | 16.8% |
| 4 | 204710 | 2.6% |
| 1 | 67366 | 0.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 6156981 | |
| 3 | 1299337 | 16.8% |
| 4 | 204710 | 2.6% |
| 1 | 67366 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 6156981 | |
| 3 | 1299337 | 16.8% |
| 4 | 204710 | 2.6% |
| 1 | 67366 | 0.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7728394 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 6156981 | |
| 3 | 1299337 | 16.8% |
| 4 | 204710 | 2.6% |
| 1 | 67366 | 0.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7728394 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 6156981 | |
| 3 | 1299337 | 16.8% |
| 4 | 204710 | 2.6% |
| 1 | 67366 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7728394 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 6156981 | |
| 3 | 1299337 | 16.8% |
| 4 | 204710 | 2.6% |
| 1 | 67366 | 0.9% |
Start_Time
Date
| Distinct | 5801064 |
|---|---|
| Distinct (%) | 75.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 59.0 MiB |
| Minimum | 2016-01-14 20:18:33 |
|---|---|
| Maximum | 2023-03-31 23:30:00 |
End_Time
Date
| Distinct | 6463024 |
|---|---|
| Distinct (%) | 83.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 59.0 MiB |
| Minimum | 2016-02-08 06:37:08 |
|---|---|
| Maximum | 2023-03-31 23:59:00 |
Start_Lat
Real number (ℝ)
| Distinct | 2428358 |
|---|---|
| Distinct (%) | 31.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 36.201195 |
| Minimum | 24.5548 |
|---|---|
| Maximum | 49.002201 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 59.0 MiB |
Quantile statistics
| Minimum | 24.5548 |
|---|---|
| 5-th percentile | 27.111647 |
| Q1 | 33.399631 |
| median | 35.823974 |
| Q3 | 40.084959 |
| 95-th percentile | 44.858648 |
| Maximum | 49.002201 |
| Range | 24.447401 |
| Interquartile range (IQR) | 6.6853275 |
Descriptive statistics
| Standard deviation | 5.0760791 |
|---|---|
| Coefficient of variation (CV) | 0.14021855 |
| Kurtosis | -0.53205376 |
| Mean | 36.201195 |
| Median Absolute Deviation (MAD) | 3.3899255 |
| Skewness | -0.072220725 |
| Sum | 2.7977709 × 108 |
| Variance | 25.766579 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 37.808498 | 570 | < 0.1% |
| 33.941364 | 568 | < 0.1% |
| 34.858849 | 545 | < 0.1% |
| 42.476501 | 534 | < 0.1% |
| 33.744976 | 533 | < 0.1% |
| 34.858925 | 495 | < 0.1% |
| 40.847923 | 473 | < 0.1% |
| 34.039394 | 462 | < 0.1% |
| 33.876289 | 458 | < 0.1% |
| 25.789072 | 441 | < 0.1% |
| Other values (2428348) | 7723315 |
| Value | Count | Frequency (%) |
| 24.5548 | 1 | |
| 24.555269 | 1 | |
| 24.5574 | 1 | |
| 24.559731 | 1 | |
| 24.55987 | 1 | |
| 24.560246 | 1 | |
| 24.560688 | 1 | |
| 24.562117 | 1 | |
| 24.563089 | 1 | |
| 24.566027 | 1 |
| Value | Count | Frequency (%) |
| 49.002201 | 1 | |
| 49.000759 | 1 | |
| 49.00058 | 1 | |
| 49.00056 | 1 | |
| 49.000504 | 2 | |
| 49.00049329 | 1 | |
| 49.000269 | 1 | |
| 49.00026 | 1 | |
| 48.999901 | 1 | |
| 48.999569 | 1 |
Start_Lng
Real number (ℝ)
| Distinct | 2482533 |
|---|---|
| Distinct (%) | 32.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -94.702545 |
| Minimum | -124.62383 |
|---|---|
| Maximum | -67.113167 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 7728394 |
| Negative (%) | 100.0% |
| Memory size | 59.0 MiB |
Quantile statistics
| Minimum | -124.62383 |
|---|---|
| 5-th percentile | -122.19164 |
| Q1 | -117.2194 |
| median | -87.766616 |
| Q3 | -80.353676 |
| 95-th percentile | -73.956087 |
| Maximum | -67.113167 |
| Range | 57.510666 |
| Interquartile range (IQR) | 36.86572 |
Descriptive statistics
| Standard deviation | 17.391756 |
|---|---|
| Coefficient of variation (CV) | -0.18364613 |
| Kurtosis | -1.3631024 |
| Mean | -94.702545 |
| Median Absolute Deviation (MAD) | 9.968301 |
| Skewness | -0.48291963 |
| Sum | -7.3189858 × 108 |
| Variance | 302.47319 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -122.366852 | 578 | < 0.1% |
| -118.096634 | 562 | < 0.1% |
| -82.260422 | 545 | < 0.1% |
| -84.390343 | 534 | < 0.1% |
| -83.111794 | 534 | < 0.1% |
| -73.942825 | 514 | < 0.1% |
| -80.165855 | 513 | < 0.1% |
| -82.259857 | 497 | < 0.1% |
| -118.368263 | 495 | < 0.1% |
| -80.210114 | 476 | < 0.1% |
| Other values (2482523) | 7723146 |
| Value | Count | Frequency (%) |
| -124.623833 | 1 | < 0.1% |
| -124.548074 | 2 | |
| -124.541015 | 1 | < 0.1% |
| -124.539056 | 1 | < 0.1% |
| -124.535893 | 1 | < 0.1% |
| -124.535726 | 1 | < 0.1% |
| -124.534439 | 1 | < 0.1% |
| -124.531602 | 4 | |
| -124.512297 | 1 | < 0.1% |
| -124.511949 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| -67.113167 | 1 | |
| -67.403551 | 1 | |
| -67.48413 | 1 | |
| -67.553307 | 1 | |
| -67.606864 | 1 | |
| -67.606875 | 1 | |
| -67.614387 | 1 | |
| -67.626576 | 1 | |
| -67.70337 | 1 | |
| -67.709053 | 1 |
End_Lat
Real number (ℝ)
MISSING 
| Distinct | 1568172 |
|---|---|
| Distinct (%) | 36.3% |
| Missing | 3402762 |
| Missing (%) | 44.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 36.261829 |
| Minimum | 24.566013 |
|---|---|
| Maximum | 49.075 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 59.0 MiB |
Quantile statistics
| Minimum | 24.566013 |
|---|---|
| 5-th percentile | 26.029643 |
| Q1 | 33.46207 |
| median | 36.183495 |
| Q3 | 40.17892 |
| 95-th percentile | 44.981551 |
| Maximum | 49.075 |
| Range | 24.508987 |
| Interquartile range (IQR) | 6.7168502 |
Descriptive statistics
| Standard deviation | 5.2729045 |
|---|---|
| Coefficient of variation (CV) | 0.14541199 |
| Kurtosis | -0.55709151 |
| Mean | 36.261829 |
| Median Absolute Deviation (MAD) | 3.4407145 |
| Skewness | -0.15821723 |
| Sum | 1.5685533 × 108 |
| Variance | 27.803522 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 28.450015 | 1039 | < 0.1% |
| 25.701774 | 860 | < 0.1% |
| 25.684322 | 836 | < 0.1% |
| 28.449928 | 794 | < 0.1% |
| 25.686252 | 720 | < 0.1% |
| 25.924771 | 712 | < 0.1% |
| 25.889378 | 712 | < 0.1% |
| 25.73316 | 680 | < 0.1% |
| 28.45019 | 656 | < 0.1% |
| 28.42136 | 654 | < 0.1% |
| Other values (1568162) | 4317969 | |
| (Missing) | 3402762 |
| Value | Count | Frequency (%) |
| 24.566013 | 1 | < 0.1% |
| 24.569978 | 3 | |
| 24.570107 | 4 | |
| 24.57011 | 1 | < 0.1% |
| 24.57018 | 1 | < 0.1% |
| 24.57029 | 1 | < 0.1% |
| 24.57036 | 1 | < 0.1% |
| 24.570461 | 1 | < 0.1% |
| 24.57124 | 1 | < 0.1% |
| 24.57126 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 49.075 | 1 | < 0.1% |
| 49.00222329 | 1 | < 0.1% |
| 49.00214 | 1 | < 0.1% |
| 49.002025 | 2 | |
| 49.000769 | 2 | |
| 49.00076 | 3 | |
| 49.000641 | 1 | < 0.1% |
| 49.00056 | 1 | < 0.1% |
| 49.000025 | 1 | < 0.1% |
| 48.999966 | 2 |
End_Lng
Real number (ℝ)
MISSING 
| Distinct | 1605789 |
|---|---|
| Distinct (%) | 37.1% |
| Missing | 3402762 |
| Missing (%) | 44.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -95.72557 |
| Minimum | -124.54575 |
|---|---|
| Maximum | -67.109242 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 4325632 |
| Negative (%) | 56.0% |
| Memory size | 59.0 MiB |
Quantile statistics
| Minimum | -124.54575 |
|---|---|
| 5-th percentile | -122.26437 |
| Q1 | -117.75434 |
| median | -88.02789 |
| Q3 | -80.247086 |
| 95-th percentile | -74.023676 |
| Maximum | -67.109242 |
| Range | 57.436506 |
| Interquartile range (IQR) | 37.507258 |
Descriptive statistics
| Standard deviation | 18.107928 |
|---|---|
| Coefficient of variation (CV) | -0.189165 |
| Kurtosis | -1.5556514 |
| Mean | -95.72557 |
| Median Absolute Deviation (MAD) | 10.95307 |
| Skewness | -0.36549222 |
| Sum | -4.1407359 × 108 |
| Variance | 327.89704 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -81.471375 | 1037 | < 0.1% |
| -80.334179 | 860 | < 0.1% |
| -80.416621 | 836 | < 0.1% |
| -81.477219 | 794 | < 0.1% |
| -80.416521 | 721 | < 0.1% |
| -80.293318 | 712 | < 0.1% |
| -80.336612 | 679 | < 0.1% |
| -81.399777 | 658 | < 0.1% |
| -78.680237 | 645 | < 0.1% |
| -81.47766 | 637 | < 0.1% |
| Other values (1605779) | 4318053 | |
| (Missing) | 3402762 |
| Value | Count | Frequency (%) |
| -124.545748 | 2 | |
| -124.544508 | 1 | < 0.1% |
| -124.543727 | 1 | < 0.1% |
| -124.539056 | 1 | < 0.1% |
| -124.535893 | 1 | < 0.1% |
| -124.535726 | 3 | |
| -124.531602 | 1 | < 0.1% |
| -124.512297 | 1 | < 0.1% |
| -124.509263 | 1 | < 0.1% |
| -124.497829 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| -67.109242 | 1 | |
| -67.40355 | 1 | |
| -67.48413 | 1 | |
| -67.606864 | 1 | |
| -67.62034 | 1 | |
| -67.626576 | 1 | |
| -67.626605 | 1 | |
| -67.706448 | 1 | |
| -67.739817 | 1 | |
| -67.78734 | 1 |
Distance(mi)
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 22382 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.56184228 |
| Minimum | 0 |
|---|---|
| Maximum | 441.75 |
| Zeros | 3302161 |
| Zeros (%) | 42.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 59.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0.03 |
| Q3 | 0.464 |
| 95-th percentile | 2.67 |
| Maximum | 441.75 |
| Range | 441.75 |
| Interquartile range (IQR) | 0.464 |
Descriptive statistics
| Standard deviation | 1.7768106 |
|---|---|
| Coefficient of variation (CV) | 3.1624722 |
| Kurtosis | 1649.5954 |
| Mean | 0.56184228 |
| Median Absolute Deviation (MAD) | 0.03 |
| Skewness | 20.385759 |
| Sum | 4342138.5 |
| Variance | 3.1570559 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3302161 | |
| 0.01 | 262493 | 3.4% |
| 0.008 | 14558 | 0.2% |
| 0.009 | 13836 | 0.2% |
| 0.009999999776 | 13367 | 0.2% |
| 0.007 | 12413 | 0.2% |
| 0.011 | 11625 | 0.2% |
| 0.03 | 11322 | 0.1% |
| 0.024 | 11002 | 0.1% |
| 0.028 | 10927 | 0.1% |
| Other values (22372) | 4064690 |
| Value | Count | Frequency (%) |
| 0 | 3302161 | |
| 0.001 | 5585 | 0.1% |
| 0.002 | 3078 | < 0.1% |
| 0.003 | 4263 | 0.1% |
| 0.004 | 6337 | 0.1% |
| 0.005 | 8253 | 0.1% |
| 0.006 | 10121 | 0.1% |
| 0.007 | 12413 | 0.2% |
| 0.008 | 14558 | 0.2% |
| 0.009 | 13836 | 0.2% |
| Value | Count | Frequency (%) |
| 441.75 | 1 | |
| 336.5700073 | 1 | |
| 333.6300049 | 1 | |
| 254.3999939 | 1 | |
| 251.2200012 | 1 | |
| 242.3399963 | 1 | |
| 227.2100067 | 1 | |
| 224.5899963 | 1 | |
| 210.0800018 | 1 | |
| 194.7299957 | 1 |
Description
Text
| Distinct | 3761578 |
|---|---|
| Distinct (%) | 48.7% |
| Missing | 5 |
| Missing (%) | < 0.1% |
| Memory size | 59.0 MiB |
Length
| Max length | 679 |
|---|---|
| Median length | 456 |
| Mean length | 68.837155 |
| Min length | 2 |
Characters and Unicode
| Total characters | 532000315 |
|---|---|
| Distinct characters | 105 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2704451 ? |
|---|---|
| Unique (%) | 35.0% |
Sample
| 1st row | Right lane blocked due to accident on I-70 Eastbound at Exit 41 OH-235 State Route 4. |
|---|---|
| 2nd row | Accident on Brice Rd at Tussing Rd. Expect delays. |
| 3rd row | Accident on OH-32 State Route 32 Westbound at Dela Palma Rd. Expect delays. |
| 4th row | Accident on I-75 Southbound at Exits 52 52B US-35. Expect delays. |
| 5th row | Accident on McEwen Rd at OH-725 Miamisburg Centerville Rd. Expect delays. |
| Value | Count | Frequency (%) |
| on | 6287667 | 6.6% |
| accident | 5703508 | 6.0% |
| to | 4570428 | 4.8% |
| at | 3662924 | 3.9% |
| due | 2871138 | 3.0% |
| rd | 2602371 | 2.7% |
| 2186578 | 2.3% | |
| near | 1721796 | 1.8% |
| blocked | 1720312 | 1.8% |
| from | 1607181 | 1.7% |
| Other values (225147) | 61729359 |
Most occurring characters
| Value | Count | Frequency (%) |
| 86934796 | ||
| t | 34426190 | 6.5% |
| e | 31991488 | 6.0% |
| n | 30314954 | 5.7% |
| o | 28376352 | 5.3% |
| a | 23052293 | 4.3% |
| d | 22676534 | 4.3% |
| i | 21063299 | 4.0% |
| c | 19870368 | 3.7% |
| r | 16094461 | 3.0% |
| Other values (95) | 217199580 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 532000315 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 86934796 | ||
| t | 34426190 | 6.5% |
| e | 31991488 | 6.0% |
| n | 30314954 | 5.7% |
| o | 28376352 | 5.3% |
| a | 23052293 | 4.3% |
| d | 22676534 | 4.3% |
| i | 21063299 | 4.0% |
| c | 19870368 | 3.7% |
| r | 16094461 | 3.0% |
| Other values (95) | 217199580 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 532000315 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 86934796 | ||
| t | 34426190 | 6.5% |
| e | 31991488 | 6.0% |
| n | 30314954 | 5.7% |
| o | 28376352 | 5.3% |
| a | 23052293 | 4.3% |
| d | 22676534 | 4.3% |
| i | 21063299 | 4.0% |
| c | 19870368 | 3.7% |
| r | 16094461 | 3.0% |
| Other values (95) | 217199580 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 532000315 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 86934796 | ||
| t | 34426190 | 6.5% |
| e | 31991488 | 6.0% |
| n | 30314954 | 5.7% |
| o | 28376352 | 5.3% |
| a | 23052293 | 4.3% |
| d | 22676534 | 4.3% |
| i | 21063299 | 4.0% |
| c | 19870368 | 3.7% |
| r | 16094461 | 3.0% |
| Other values (95) | 217199580 |
Street
Text
| Distinct | 336306 |
|---|---|
| Distinct (%) | 4.4% |
| Missing | 10869 |
| Missing (%) | 0.1% |
| Memory size | 59.0 MiB |
Length
| Max length | 59 |
|---|---|
| Median length | 47 |
| Mean length | 11.062818 |
| Min length | 1 |
Characters and Unicode
| Total characters | 85377573 |
|---|---|
| Distinct characters | 80 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 129934 ? |
|---|---|
| Unique (%) | 1.7% |
Sample
| 1st row | I-70 E |
|---|---|
| 2nd row | Brice Rd |
| 3rd row | State Route 32 |
| 4th row | I-75 S |
| 5th row | Miamisburg Centerville Rd |
| Value | Count | Frequency (%) |
| n | 1199552 | 6.3% |
| s | 1194304 | 6.2% |
| rd | 1159617 | 6.0% |
| w | 941905 | 4.9% |
| e | 931753 | 4.9% |
| st | 684198 | 3.6% |
| ave | 640512 | 3.3% |
| blvd | 343950 | 1.8% |
| fwy | 330007 | 1.7% |
| dr | 327207 | 1.7% |
| Other values (74656) | 11416775 |
Most occurring characters
| Value | Count | Frequency (%) |
| 13148775 | 15.4% | |
| e | 4752092 | 5.6% |
| a | 3818802 | 4.5% |
| r | 3231482 | 3.8% |
| t | 3213194 | 3.8% |
| o | 2990987 | 3.5% |
| S | 2982408 | 3.5% |
| n | 2968805 | 3.5% |
| d | 2778142 | 3.3% |
| l | 2764944 | 3.2% |
| Other values (70) | 42727942 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 85377573 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 13148775 | 15.4% | |
| e | 4752092 | 5.6% |
| a | 3818802 | 4.5% |
| r | 3231482 | 3.8% |
| t | 3213194 | 3.8% |
| o | 2990987 | 3.5% |
| S | 2982408 | 3.5% |
| n | 2968805 | 3.5% |
| d | 2778142 | 3.3% |
| l | 2764944 | 3.2% |
| Other values (70) | 42727942 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 85377573 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 13148775 | 15.4% | |
| e | 4752092 | 5.6% |
| a | 3818802 | 4.5% |
| r | 3231482 | 3.8% |
| t | 3213194 | 3.8% |
| o | 2990987 | 3.5% |
| S | 2982408 | 3.5% |
| n | 2968805 | 3.5% |
| d | 2778142 | 3.3% |
| l | 2764944 | 3.2% |
| Other values (70) | 42727942 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 85377573 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 13148775 | 15.4% | |
| e | 4752092 | 5.6% |
| a | 3818802 | 4.5% |
| r | 3231482 | 3.8% |
| t | 3213194 | 3.8% |
| o | 2990987 | 3.5% |
| S | 2982408 | 3.5% |
| n | 2968805 | 3.5% |
| d | 2778142 | 3.3% |
| l | 2764944 | 3.2% |
| Other values (70) | 42727942 |
City
Text
| Distinct | 13678 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 253 |
| Missing (%) | < 0.1% |
| Memory size | 59.0 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 29 |
| Mean length | 8.7755685 |
| Min length | 3 |
Characters and Unicode
| Total characters | 67818831 |
|---|---|
| Distinct characters | 66 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1023 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Dayton |
|---|---|
| 2nd row | Reynoldsburg |
| 3rd row | Williamsburg |
| 4th row | Dayton |
| 5th row | Dayton |
| Value | Count | Frequency (%) |
| san | 222025 | 2.2% |
| miami | 207501 | 2.1% |
| city | 196884 | 2.0% |
| houston | 169689 | 1.7% |
| los | 169053 | 1.7% |
| angeles | 156702 | 1.6% |
| charlotte | 139395 | 1.4% |
| dallas | 131585 | 1.3% |
| orlando | 109733 | 1.1% |
| beach | 98038 | 1.0% |
| Other values (10808) | 8345041 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 6545547 | 9.7% |
| e | 5932168 | 8.7% |
| o | 5228124 | 7.7% |
| n | 5137103 | 7.6% |
| l | 4621065 | 6.8% |
| i | 4263043 | 6.3% |
| r | 3924811 | 5.8% |
| t | 3811793 | 5.6% |
| s | 3182527 | 4.7% |
| 2217505 | 3.3% | |
| Other values (56) | 22955145 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 67818831 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 6545547 | 9.7% |
| e | 5932168 | 8.7% |
| o | 5228124 | 7.7% |
| n | 5137103 | 7.6% |
| l | 4621065 | 6.8% |
| i | 4263043 | 6.3% |
| r | 3924811 | 5.8% |
| t | 3811793 | 5.6% |
| s | 3182527 | 4.7% |
| 2217505 | 3.3% | |
| Other values (56) | 22955145 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 67818831 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 6545547 | 9.7% |
| e | 5932168 | 8.7% |
| o | 5228124 | 7.7% |
| n | 5137103 | 7.6% |
| l | 4621065 | 6.8% |
| i | 4263043 | 6.3% |
| r | 3924811 | 5.8% |
| t | 3811793 | 5.6% |
| s | 3182527 | 4.7% |
| 2217505 | 3.3% | |
| Other values (56) | 22955145 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 67818831 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 6545547 | 9.7% |
| e | 5932168 | 8.7% |
| o | 5228124 | 7.7% |
| n | 5137103 | 7.6% |
| l | 4621065 | 6.8% |
| i | 4263043 | 6.3% |
| r | 3924811 | 5.8% |
| t | 3811793 | 5.6% |
| s | 3182527 | 4.7% |
| 2217505 | 3.3% | |
| Other values (56) | 22955145 |
County
Text
| Distinct | 1871 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 59.0 MiB |
Length
| Max length | 30 |
|---|---|
| Median length | 23 |
| Mean length | 8.0643539 |
| Min length | 3 |
Characters and Unicode
| Total characters | 62324504 |
|---|---|
| Distinct characters | 59 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 52 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Montgomery |
|---|---|
| 2nd row | Franklin |
| 3rd row | Clermont |
| 4th row | Montgomery |
| 5th row | Montgomery |
| Value | Count | Frequency (%) |
| los | 526853 | 5.6% |
| angeles | 526851 | 5.6% |
| san | 311726 | 3.3% |
| miami-dade | 251601 | 2.7% |
| orange | 241275 | 2.6% |
| harris | 181196 | 1.9% |
| dallas | 157024 | 1.7% |
| mecklenburg | 147265 | 1.6% |
| montgomery | 136788 | 1.5% |
| wake | 117890 | 1.3% |
| Other values (1772) | 6803587 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 6748757 | 10.8% |
| e | 6490587 | 10.4% |
| n | 4953741 | 7.9% |
| o | 4391707 | 7.0% |
| r | 4005541 | 6.4% |
| s | 3454789 | 5.5% |
| i | 3316153 | 5.3% |
| l | 3243086 | 5.2% |
| t | 2162391 | 3.5% |
| g | 1805031 | 2.9% |
| Other values (49) | 21752721 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 62324504 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 6748757 | 10.8% |
| e | 6490587 | 10.4% |
| n | 4953741 | 7.9% |
| o | 4391707 | 7.0% |
| r | 4005541 | 6.4% |
| s | 3454789 | 5.5% |
| i | 3316153 | 5.3% |
| l | 3243086 | 5.2% |
| t | 2162391 | 3.5% |
| g | 1805031 | 2.9% |
| Other values (49) | 21752721 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 62324504 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 6748757 | 10.8% |
| e | 6490587 | 10.4% |
| n | 4953741 | 7.9% |
| o | 4391707 | 7.0% |
| r | 4005541 | 6.4% |
| s | 3454789 | 5.5% |
| i | 3316153 | 5.3% |
| l | 3243086 | 5.2% |
| t | 2162391 | 3.5% |
| g | 1805031 | 2.9% |
| Other values (49) | 21752721 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 62324504 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 6748757 | 10.8% |
| e | 6490587 | 10.4% |
| n | 4953741 | 7.9% |
| o | 4391707 | 7.0% |
| r | 4005541 | 6.4% |
| s | 3454789 | 5.5% |
| i | 3316153 | 5.3% |
| l | 3243086 | 5.2% |
| t | 2162391 | 3.5% |
| g | 1805031 | 2.9% |
| Other values (49) | 21752721 |
State
Categorical
| Distinct | 49 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 59.0 MiB |
| CA | |
|---|---|
| FL | |
| TX | |
| SC | |
| NY | 347960 |
| Other values (44) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 15456788 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | OH |
|---|---|
| 2nd row | OH |
| 3rd row | OH |
| 4th row | OH |
| 5th row | OH |
Common Values
| Value | Count | Frequency (%) |
| CA | 1741433 | |
| FL | 880192 | 11.4% |
| TX | 582837 | 7.5% |
| SC | 382557 | 5.0% |
| NY | 347960 | 4.5% |
| NC | 338199 | 4.4% |
| VA | 303301 | 3.9% |
| PA | 296620 | 3.8% |
| MN | 192084 | 2.5% |
| OR | 179660 | 2.3% |
| Other values (39) | 2483551 |
Length
| Value | Count | Frequency (%) |
| ca | 1741433 | |
| fl | 880192 | 11.4% |
| tx | 582837 | 7.5% |
| sc | 382557 | 5.0% |
| ny | 347960 | 4.5% |
| nc | 338199 | 4.4% |
| va | 303301 | 3.9% |
| pa | 296620 | 3.8% |
| mn | 192084 | 2.5% |
| or | 179660 | 2.3% |
| Other values (39) | 2483551 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 3151246 | |
| C | 2642709 | |
| N | 1328134 | |
| L | 1299895 | |
| T | 947731 | 6.1% |
| F | 880192 | 5.7% |
| M | 690711 | 4.5% |
| X | 582837 | 3.8% |
| O | 549630 | 3.6% |
| I | 487715 | 3.2% |
| Other values (14) | 2895988 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 15456788 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 3151246 | |
| C | 2642709 | |
| N | 1328134 | |
| L | 1299895 | |
| T | 947731 | 6.1% |
| F | 880192 | 5.7% |
| M | 690711 | 4.5% |
| X | 582837 | 3.8% |
| O | 549630 | 3.6% |
| I | 487715 | 3.2% |
| Other values (14) | 2895988 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 15456788 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 3151246 | |
| C | 2642709 | |
| N | 1328134 | |
| L | 1299895 | |
| T | 947731 | 6.1% |
| F | 880192 | 5.7% |
| M | 690711 | 4.5% |
| X | 582837 | 3.8% |
| O | 549630 | 3.6% |
| I | 487715 | 3.2% |
| Other values (14) | 2895988 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 15456788 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 3151246 | |
| C | 2642709 | |
| N | 1328134 | |
| L | 1299895 | |
| T | 947731 | 6.1% |
| F | 880192 | 5.7% |
| M | 690711 | 4.5% |
| X | 582837 | 3.8% |
| O | 549630 | 3.6% |
| I | 487715 | 3.2% |
| Other values (14) | 2895988 |
Zipcode
Text
| Distinct | 825094 |
|---|---|
| Distinct (%) | 10.7% |
| Missing | 1915 |
| Missing (%) | < 0.1% |
| Memory size | 59.0 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 5 |
| Mean length | 6.4678536 |
| Min length | 5 |
Characters and Unicode
| Total characters | 49973735 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 450606 ? |
|---|---|
| Unique (%) | 5.8% |
Sample
| 1st row | 45424 |
|---|---|
| 2nd row | 43068-3402 |
| 3rd row | 45176 |
| 4th row | 45417 |
| 5th row | 45459 |
| Value | Count | Frequency (%) |
| 91761 | 11247 | 0.1% |
| 91706 | 10022 | 0.1% |
| 92407 | 8922 | 0.1% |
| 92507 | 8850 | 0.1% |
| 33186 | 8375 | 0.1% |
| 32819 | 7461 | 0.1% |
| 91765 | 7377 | 0.1% |
| 33169 | 7106 | 0.1% |
| 90023 | 7066 | 0.1% |
| 92324 | 7010 | 0.1% |
| Other values (825084) | 7643043 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 6303792 | |
| 2 | 6159301 | |
| 3 | 5704889 | |
| 1 | 5682170 | |
| 9 | 4463210 | |
| 7 | 4316166 | |
| 5 | 4183724 | |
| 4 | 4137052 | |
| 6 | 3427226 | |
| 8 | 3321903 | |
| Other values (3) | 2274302 | 4.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 49973735 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 6303792 | |
| 2 | 6159301 | |
| 3 | 5704889 | |
| 1 | 5682170 | |
| 9 | 4463210 | |
| 7 | 4316166 | |
| 5 | 4183724 | |
| 4 | 4137052 | |
| 6 | 3427226 | |
| 8 | 3321903 | |
| Other values (3) | 2274302 | 4.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 49973735 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 6303792 | |
| 2 | 6159301 | |
| 3 | 5704889 | |
| 1 | 5682170 | |
| 9 | 4463210 | |
| 7 | 4316166 | |
| 5 | 4183724 | |
| 4 | 4137052 | |
| 6 | 3427226 | |
| 8 | 3321903 | |
| Other values (3) | 2274302 | 4.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 49973735 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 6303792 | |
| 2 | 6159301 | |
| 3 | 5704889 | |
| 1 | 5682170 | |
| 9 | 4463210 | |
| 7 | 4316166 | |
| 5 | 4183724 | |
| 4 | 4137052 | |
| 6 | 3427226 | |
| 8 | 3321903 | |
| Other values (3) | 2274302 | 4.6% |
Country
Categorical
CONSTANT 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 59.0 MiB |
| US |
|---|
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 15456788 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | US |
|---|---|
| 2nd row | US |
| 3rd row | US |
| 4th row | US |
| 5th row | US |
Common Values
| Value | Count | Frequency (%) |
| US | 7728394 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| us | 7728394 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 7728394 | |
| S | 7728394 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 15456788 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| U | 7728394 | |
| S | 7728394 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 15456788 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| U | 7728394 | |
| S | 7728394 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 15456788 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| U | 7728394 | |
| S | 7728394 |
Timezone
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 7808 |
| Missing (%) | 0.1% |
| Memory size | 59.0 MiB |
| US/Eastern | |
|---|---|
| US/Pacific | |
| US/Central | |
| US/Mountain |
Length
| Max length | 11 |
|---|---|
| Median length | 10 |
| Mean length | 10.055931 |
| Min length | 10 |
Characters and Unicode
| Total characters | 77637679 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | US/Eastern |
|---|---|
| 2nd row | US/Eastern |
| 3rd row | US/Eastern |
| 4th row | US/Eastern |
| 5th row | US/Eastern |
Common Values
| Value | Count | Frequency (%) |
| US/Eastern | 3580167 | |
| US/Pacific | 2062984 | |
| US/Central | 1645616 | |
| US/Mountain | 431819 | 5.6% |
| (Missing) | 7808 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| us/eastern | 3580167 | |
| us/pacific | 2062984 | |
| us/central | 1645616 | |
| us/mountain | 431819 | 5.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 7720586 | |
| a | 7720586 | |
| / | 7720586 | |
| S | 7720586 | |
| n | 6089421 | 7.8% |
| t | 5657602 | 7.3% |
| e | 5225783 | 6.7% |
| r | 5225783 | 6.7% |
| i | 4557787 | 5.9% |
| c | 4125968 | 5.3% |
| Other values (9) | 15872991 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 77637679 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| U | 7720586 | |
| a | 7720586 | |
| / | 7720586 | |
| S | 7720586 | |
| n | 6089421 | 7.8% |
| t | 5657602 | 7.3% |
| e | 5225783 | 6.7% |
| r | 5225783 | 6.7% |
| i | 4557787 | 5.9% |
| c | 4125968 | 5.3% |
| Other values (9) | 15872991 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 77637679 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| U | 7720586 | |
| a | 7720586 | |
| / | 7720586 | |
| S | 7720586 | |
| n | 6089421 | 7.8% |
| t | 5657602 | 7.3% |
| e | 5225783 | 6.7% |
| r | 5225783 | 6.7% |
| i | 4557787 | 5.9% |
| c | 4125968 | 5.3% |
| Other values (9) | 15872991 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 77637679 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| U | 7720586 | |
| a | 7720586 | |
| / | 7720586 | |
| S | 7720586 | |
| n | 6089421 | 7.8% |
| t | 5657602 | 7.3% |
| e | 5225783 | 6.7% |
| r | 5225783 | 6.7% |
| i | 4557787 | 5.9% |
| c | 4125968 | 5.3% |
| Other values (9) | 15872991 |
Airport_Code
Text
| Distinct | 2045 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 22635 |
| Missing (%) | 0.3% |
| Memory size | 59.0 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 30823036 |
|---|---|
| Distinct characters | 36 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 18 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | KFFO |
|---|---|
| 2nd row | KCMH |
| 3rd row | KI69 |
| 4th row | KDAY |
| 5th row | KMGY |
| Value | Count | Frequency (%) |
| kcqt | 118332 | 1.5% |
| krdu | 107267 | 1.4% |
| kmcj | 101786 | 1.3% |
| kbna | 98926 | 1.3% |
| kclt | 97273 | 1.3% |
| korl | 82480 | 1.1% |
| kmia | 81358 | 1.1% |
| kbtr | 78304 | 1.0% |
| kopf | 70665 | 0.9% |
| kdal | 69353 | 0.9% |
| Other values (2035) | 6800015 |
Most occurring characters
| Value | Count | Frequency (%) |
| K | 8170466 | |
| A | 1664545 | 5.4% |
| C | 1572716 | 5.1% |
| M | 1370216 | 4.4% |
| T | 1362405 | 4.4% |
| L | 1307926 | 4.2% |
| S | 1296708 | 4.2% |
| D | 1235661 | 4.0% |
| R | 1192091 | 3.9% |
| O | 1119180 | 3.6% |
| Other values (26) | 10531122 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 30823036 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| K | 8170466 | |
| A | 1664545 | 5.4% |
| C | 1572716 | 5.1% |
| M | 1370216 | 4.4% |
| T | 1362405 | 4.4% |
| L | 1307926 | 4.2% |
| S | 1296708 | 4.2% |
| D | 1235661 | 4.0% |
| R | 1192091 | 3.9% |
| O | 1119180 | 3.6% |
| Other values (26) | 10531122 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 30823036 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| K | 8170466 | |
| A | 1664545 | 5.4% |
| C | 1572716 | 5.1% |
| M | 1370216 | 4.4% |
| T | 1362405 | 4.4% |
| L | 1307926 | 4.2% |
| S | 1296708 | 4.2% |
| D | 1235661 | 4.0% |
| R | 1192091 | 3.9% |
| O | 1119180 | 3.6% |
| Other values (26) | 10531122 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 30823036 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| K | 8170466 | |
| A | 1664545 | 5.4% |
| C | 1572716 | 5.1% |
| M | 1370216 | 4.4% |
| T | 1362405 | 4.4% |
| L | 1307926 | 4.2% |
| S | 1296708 | 4.2% |
| D | 1235661 | 4.0% |
| R | 1192091 | 3.9% |
| O | 1119180 | 3.6% |
| Other values (26) | 10531122 |
MISSING 
| Distinct | 941331 |
|---|---|
| Distinct (%) | 12.4% |
| Missing | 120228 |
| Missing (%) | 1.6% |
| Memory size | 59.0 MiB |
| Minimum | 2016-01-14 19:51:00 |
|---|---|
| Maximum | 2023-03-31 23:53:00 |
Temperature(F)
Real number (ℝ)
MISSING 
| Distinct | 860 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 163853 |
| Missing (%) | 2.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 61.663286 |
| Minimum | -89 |
|---|---|
| Maximum | 207 |
| Zeros | 2775 |
| Zeros (%) | < 0.1% |
| Negative | 19478 |
| Negative (%) | 0.3% |
| Memory size | 59.0 MiB |
Quantile statistics
| Minimum | -89 |
|---|---|
| 5-th percentile | 28 |
| Q1 | 49 |
| median | 64 |
| Q3 | 76 |
| 95-th percentile | 89 |
| Maximum | 207 |
| Range | 296 |
| Interquartile range (IQR) | 27 |
Descriptive statistics
| Standard deviation | 19.013653 |
|---|---|
| Coefficient of variation (CV) | 0.30834642 |
| Kurtosis | -0.0012691043 |
| Mean | 61.663286 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | -0.513733 |
| Sum | 4.6645445 × 108 |
| Variance | 361.51901 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 77 | 170991 | 2.2% |
| 73 | 170898 | 2.2% |
| 68 | 163767 | 2.1% |
| 72 | 160498 | 2.1% |
| 75 | 158448 | 2.1% |
| 70 | 155568 | 2.0% |
| 63 | 149787 | 1.9% |
| 59 | 149017 | 1.9% |
| 64 | 148466 | 1.9% |
| 79 | 147140 | 1.9% |
| Other values (850) | 5989961 | |
| (Missing) | 163853 | 2.1% |
| Value | Count | Frequency (%) |
| -89 | 10 | |
| -77.8 | 11 | |
| -58 | 1 | < 0.1% |
| -50 | 1 | < 0.1% |
| -45 | 1 | < 0.1% |
| -44 | 1 | < 0.1% |
| -40 | 2 | < 0.1% |
| -38 | 3 | < 0.1% |
| -37 | 5 | |
| -36 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 207 | 3 | |
| 203 | 1 | < 0.1% |
| 196 | 5 | |
| 189 | 1 | < 0.1% |
| 174 | 2 | < 0.1% |
| 172 | 2 | < 0.1% |
| 170.6 | 1 | < 0.1% |
| 168.8 | 1 | < 0.1% |
| 167 | 1 | < 0.1% |
| 162 | 2 | < 0.1% |
Wind_Chill(F)
Real number (ℝ)
MISSING 
| Distinct | 1001 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1999019 |
| Missing (%) | 25.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 58.251048 |
| Minimum | -89 |
|---|---|
| Maximum | 207 |
| Zeros | 3948 |
| Zeros (%) | 0.1% |
| Negative | 64428 |
| Negative (%) | 0.8% |
| Memory size | 59.0 MiB |
Quantile statistics
| Minimum | -89 |
|---|---|
| 5-th percentile | 18 |
| Q1 | 43 |
| median | 62 |
| Q3 | 75 |
| 95-th percentile | 88 |
| Maximum | 207 |
| Range | 296 |
| Interquartile range (IQR) | 32 |
Descriptive statistics
| Standard deviation | 22.389832 |
|---|---|
| Coefficient of variation (CV) | 0.38436788 |
| Kurtosis | 0.15453367 |
| Mean | 58.251048 |
| Median Absolute Deviation (MAD) | 15 |
| Skewness | -0.67278577 |
| Sum | 3.337421 × 108 |
| Variance | 501.30457 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 73 | 133584 | 1.7% |
| 72 | 125378 | 1.6% |
| 75 | 123065 | 1.6% |
| 77 | 122062 | 1.6% |
| 70 | 120727 | 1.6% |
| 63 | 115954 | 1.5% |
| 79 | 115703 | 1.5% |
| 68 | 115211 | 1.5% |
| 64 | 114934 | 1.5% |
| 66 | 112160 | 1.5% |
| Other values (991) | 4530597 | |
| (Missing) | 1999019 |
| Value | Count | Frequency (%) |
| -89 | 10 | |
| -80 | 1 | < 0.1% |
| -69 | 1 | < 0.1% |
| -65.9 | 1 | < 0.1% |
| -63 | 7 | |
| -59 | 4 | < 0.1% |
| -58 | 2 | < 0.1% |
| -55.1 | 1 | < 0.1% |
| -55 | 2 | < 0.1% |
| -54.1 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 207 | 3 | < 0.1% |
| 196 | 5 | |
| 189 | 1 | < 0.1% |
| 174 | 2 | < 0.1% |
| 172 | 2 | < 0.1% |
| 162 | 2 | < 0.1% |
| 140 | 12 | |
| 138 | 1 | < 0.1% |
| 136 | 3 | < 0.1% |
| 128 | 2 | < 0.1% |
Humidity(%)
Real number (ℝ)
MISSING 
| Distinct | 100 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 174144 |
| Missing (%) | 2.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 64.831041 |
| Minimum | 1 |
|---|---|
| Maximum | 100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 59.0 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 24 |
| Q1 | 48 |
| median | 67 |
| Q3 | 84 |
| 95-th percentile | 97 |
| Maximum | 100 |
| Range | 99 |
| Interquartile range (IQR) | 36 |
Descriptive statistics
| Standard deviation | 22.820968 |
|---|---|
| Coefficient of variation (CV) | 0.3520068 |
| Kurtosis | -0.7234553 |
| Mean | 64.831041 |
| Median Absolute Deviation (MAD) | 18 |
| Skewness | -0.39484242 |
| Sum | 4.897499 × 108 |
| Variance | 520.79656 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 93 | 290345 | 3.8% |
| 100 | 286680 | 3.7% |
| 87 | 169582 | 2.2% |
| 90 | 166492 | 2.2% |
| 89 | 140593 | 1.8% |
| 96 | 134809 | 1.7% |
| 84 | 126652 | 1.6% |
| 81 | 126612 | 1.6% |
| 82 | 124386 | 1.6% |
| 86 | 121255 | 1.6% |
| Other values (90) | 5866844 | |
| (Missing) | 174144 | 2.3% |
| Value | Count | Frequency (%) |
| 1 | 49 | < 0.1% |
| 2 | 189 | < 0.1% |
| 3 | 670 | < 0.1% |
| 4 | 2167 | < 0.1% |
| 5 | 4113 | 0.1% |
| 6 | 6010 | |
| 7 | 8072 | |
| 8 | 9661 | |
| 9 | 11116 | |
| 10 | 13495 |
| Value | Count | Frequency (%) |
| 100 | 286680 | |
| 99 | 14262 | 0.2% |
| 98 | 6977 | 0.1% |
| 97 | 88156 | 1.1% |
| 96 | 134809 | |
| 95 | 9612 | 0.1% |
| 94 | 119009 | |
| 93 | 290345 | |
| 92 | 66899 | 0.9% |
| 91 | 37561 | 0.5% |
Pressure(in)
Real number (ℝ)
MISSING 
| Distinct | 1144 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 140679 |
| Missing (%) | 1.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29.538986 |
| Minimum | 0 |
|---|---|
| Maximum | 58.63 |
| Zeros | 3 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 59.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 27.98 |
| Q1 | 29.37 |
| median | 29.86 |
| Q3 | 30.03 |
| 95-th percentile | 30.26 |
| Maximum | 58.63 |
| Range | 58.63 |
| Interquartile range (IQR) | 0.66 |
Descriptive statistics
| Standard deviation | 1.0061898 |
|---|---|
| Coefficient of variation (CV) | 0.034063113 |
| Kurtosis | 21.841661 |
| Mean | 29.538986 |
| Median Absolute Deviation (MAD) | 0.24 |
| Skewness | -3.6387719 |
| Sum | 2.241334 × 108 |
| Variance | 1.0124179 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 29.96 | 123289 | 1.6% |
| 29.99 | 121836 | 1.6% |
| 30.01 | 119735 | 1.5% |
| 29.94 | 119000 | 1.5% |
| 30.04 | 113905 | 1.5% |
| 29.97 | 112320 | 1.5% |
| 30.03 | 110898 | 1.4% |
| 29.91 | 110700 | 1.4% |
| 30 | 109999 | 1.4% |
| 29.95 | 109924 | 1.4% |
| Other values (1134) | 6436109 | |
| (Missing) | 140679 | 1.8% |
| Value | Count | Frequency (%) |
| 0 | 3 | < 0.1% |
| 0.02 | 1 | < 0.1% |
| 0.12 | 1 | < 0.1% |
| 0.29 | 2 | < 0.1% |
| 0.3 | 6 | |
| 0.39 | 1 | < 0.1% |
| 2.98 | 1 | < 0.1% |
| 2.99 | 9 | |
| 3 | 2 | < 0.1% |
| 3.01 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 58.63 | 9 | |
| 58.39 | 2 | < 0.1% |
| 58.32 | 1 | < 0.1% |
| 58.13 | 1 | < 0.1% |
| 58.1 | 4 | |
| 58.04 | 3 | < 0.1% |
| 58.03 | 1 | < 0.1% |
| 57.74 | 1 | < 0.1% |
| 57.54 | 2 | < 0.1% |
| 56.54 | 2 | < 0.1% |
Visibility(mi)
Real number (ℝ)
MISSING 
| Distinct | 92 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 177098 |
| Missing (%) | 2.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.0903764 |
| Minimum | 0 |
|---|---|
| Maximum | 140 |
| Zeros | 7679 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 59.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2.5 |
| Q1 | 10 |
| median | 10 |
| Q3 | 10 |
| 95-th percentile | 10 |
| Maximum | 140 |
| Range | 140 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2.6883159 |
|---|---|
| Coefficient of variation (CV) | 0.29573208 |
| Kurtosis | 81.893919 |
| Mean | 9.0903764 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.3166628 |
| Sum | 68644123 |
| Variance | 7.2270425 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 6070231 | |
| 7 | 217027 | 2.8% |
| 9 | 188529 | 2.4% |
| 8 | 149975 | 1.9% |
| 5 | 144153 | 1.9% |
| 6 | 126586 | 1.6% |
| 2 | 121785 | 1.6% |
| 4 | 119770 | 1.5% |
| 3 | 117493 | 1.5% |
| 1 | 102557 | 1.3% |
| Other values (82) | 193190 | 2.5% |
| (Missing) | 177098 | 2.3% |
| Value | Count | Frequency (%) |
| 0 | 7679 | 0.1% |
| 0.06 | 323 | < 0.1% |
| 0.1 | 1287 | < 0.1% |
| 0.12 | 1775 | < 0.1% |
| 0.19 | 41 | < 0.1% |
| 0.2 | 12105 | |
| 0.25 | 27344 | |
| 0.31 | 4 | < 0.1% |
| 0.38 | 337 | < 0.1% |
| 0.4 | 98 | < 0.1% |
| Value | Count | Frequency (%) |
| 140 | 3 | < 0.1% |
| 130 | 1 | < 0.1% |
| 120 | 5 | < 0.1% |
| 111 | 3 | < 0.1% |
| 110 | 1 | < 0.1% |
| 105 | 1 | < 0.1% |
| 101 | 1 | < 0.1% |
| 100 | 47 | |
| 98 | 1 | < 0.1% |
| 90 | 13 | < 0.1% |
Wind_Direction
Categorical
MISSING 
| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 175206 |
| Missing (%) | 2.3% |
| Memory size | 59.0 MiB |
| CALM | |
|---|---|
| S | 419989 |
| SSW | 384840 |
| W | 383913 |
| WNW | 378781 |
| Other values (19) |
Length
| Max length | 8 |
|---|---|
| Median length | 5 |
| Mean length | 2.8361859 |
| Min length | 1 |
Characters and Unicode
| Total characters | 21422245 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Calm |
|---|---|
| 2nd row | Calm |
| 3rd row | SW |
| 4th row | SW |
| 5th row | SW |
Common Values
| Value | Count | Frequency (%) |
| CALM | 961624 | 12.4% |
| S | 419989 | 5.4% |
| SSW | 384840 | 5.0% |
| W | 383913 | 5.0% |
| WNW | 378781 | 4.9% |
| NW | 369352 | 4.8% |
| Calm | 368557 | 4.8% |
| SW | 364470 | 4.7% |
| WSW | 353806 | 4.6% |
| SSE | 349110 | 4.5% |
| Other values (14) | 3218746 |
Length
| Value | Count | Frequency (%) |
| calm | 1330181 | |
| s | 419989 | 5.6% |
| ssw | 384840 | 5.1% |
| w | 383913 | 5.1% |
| wnw | 378781 | 5.0% |
| nw | 369352 | 4.9% |
| sw | 364470 | 4.8% |
| wsw | 353806 | 4.7% |
| sse | 349110 | 4.6% |
| nnw | 333427 | 4.4% |
| Other values (13) | 2885319 |
Most occurring characters
| Value | Count | Frequency (%) |
| W | 3465927 | |
| S | 3346752 | |
| N | 2903258 | |
| E | 2593990 | |
| C | 1330181 | 6.2% |
| A | 1212190 | 5.7% |
| L | 961624 | 4.5% |
| M | 961624 | 4.5% |
| a | 700094 | 3.3% |
| t | 599056 | 2.8% |
| Other values (12) | 3347549 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 21422245 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| W | 3465927 | |
| S | 3346752 | |
| N | 2903258 | |
| E | 2593990 | |
| C | 1330181 | 6.2% |
| A | 1212190 | 5.7% |
| L | 961624 | 4.5% |
| M | 961624 | 4.5% |
| a | 700094 | 3.3% |
| t | 599056 | 2.8% |
| Other values (12) | 3347549 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 21422245 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| W | 3465927 | |
| S | 3346752 | |
| N | 2903258 | |
| E | 2593990 | |
| C | 1330181 | 6.2% |
| A | 1212190 | 5.7% |
| L | 961624 | 4.5% |
| M | 961624 | 4.5% |
| a | 700094 | 3.3% |
| t | 599056 | 2.8% |
| Other values (12) | 3347549 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 21422245 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| W | 3465927 | |
| S | 3346752 | |
| N | 2903258 | |
| E | 2593990 | |
| C | 1330181 | 6.2% |
| A | 1212190 | 5.7% |
| L | 961624 | 4.5% |
| M | 961624 | 4.5% |
| a | 700094 | 3.3% |
| t | 599056 | 2.8% |
| Other values (12) | 3347549 |
Wind_Speed(mph)
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 184 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 571233 |
| Missing (%) | 7.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.6854896 |
| Minimum | 0 |
|---|---|
| Maximum | 1087 |
| Zeros | 961643 |
| Zeros (%) | 12.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 59.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 4.6 |
| median | 7 |
| Q3 | 10.4 |
| 95-th percentile | 17 |
| Maximum | 1087 |
| Range | 1087 |
| Interquartile range (IQR) | 5.8 |
Descriptive statistics
| Standard deviation | 5.4249834 |
|---|---|
| Coefficient of variation (CV) | 0.7058735 |
| Kurtosis | 1085.4752 |
| Mean | 7.6854896 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 8.0494761 |
| Sum | 55006286 |
| Variance | 29.430445 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 961643 | 12.4% |
| 5 | 534875 | 6.9% |
| 6 | 517199 | 6.7% |
| 3 | 514123 | 6.7% |
| 7 | 480904 | 6.2% |
| 8 | 432522 | 5.6% |
| 9 | 389161 | 5.0% |
| 10 | 324080 | 4.2% |
| 12 | 280269 | 3.6% |
| 4.6 | 217615 | 2.8% |
| Other values (174) | 2504770 | |
| (Missing) | 571233 | 7.4% |
| Value | Count | Frequency (%) |
| 0 | 961643 | |
| 1 | 195 | < 0.1% |
| 1.2 | 445 | < 0.1% |
| 2 | 451 | < 0.1% |
| 2.3 | 906 | < 0.1% |
| 3 | 514123 | |
| 3.5 | 203579 | 2.6% |
| 4.6 | 217615 | 2.8% |
| 5 | 534875 | |
| 5.8 | 216150 | 2.8% |
| Value | Count | Frequency (%) |
| 1087 | 1 | < 0.1% |
| 984 | 1 | < 0.1% |
| 822.8 | 7 | |
| 812 | 1 | < 0.1% |
| 703.1 | 2 | < 0.1% |
| 580 | 2 | < 0.1% |
| 518 | 2 | < 0.1% |
| 471.8 | 1 | < 0.1% |
| 328 | 1 | < 0.1% |
| 255 | 1 | < 0.1% |
Precipitation(in)
Real number (ℝ)
MISSING  SKEWED  ZEROS 
| Distinct | 299 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2203586 |
| Missing (%) | 28.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.0084072098 |
| Minimum | 0 |
|---|---|
| Maximum | 36.47 |
| Zeros | 4991718 |
| Zeros (%) | 64.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 59.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0.03 |
| Maximum | 36.47 |
| Range | 36.47 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.11022465 |
|---|---|
| Coefficient of variation (CV) | 13.110729 |
| Kurtosis | 10710.92 |
| Mean | 0.0084072098 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 85.99914 |
| Sum | 46448.22 |
| Variance | 0.012149473 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4991718 | |
| 0.01 | 151010 | 2.0% |
| 0.02 | 74008 | 1.0% |
| 0.03 | 50055 | 0.6% |
| 0.04 | 37300 | 0.5% |
| 0.05 | 29167 | 0.4% |
| 0.06 | 23958 | 0.3% |
| 0.07 | 19353 | 0.3% |
| 0.08 | 16138 | 0.2% |
| 0.09 | 14307 | 0.2% |
| Other values (289) | 117794 | 1.5% |
| (Missing) | 2203586 |
| Value | Count | Frequency (%) |
| 0 | 4991718 | |
| 0.01 | 151010 | 2.0% |
| 0.02 | 74008 | 1.0% |
| 0.03 | 50055 | 0.6% |
| 0.04 | 37300 | 0.5% |
| 0.05 | 29167 | 0.4% |
| 0.06 | 23958 | 0.3% |
| 0.07 | 19353 | 0.3% |
| 0.08 | 16138 | 0.2% |
| 0.09 | 14307 | 0.2% |
| Value | Count | Frequency (%) |
| 36.47 | 1 | < 0.1% |
| 25 | 1 | < 0.1% |
| 24 | 4 | |
| 23.97 | 1 | < 0.1% |
| 10.8 | 1 | < 0.1% |
| 10.4 | 2 | |
| 10.18 | 1 | < 0.1% |
| 10.16 | 1 | < 0.1% |
| 10.14 | 2 | |
| 10.13 | 2 |
MISSING 
| Distinct | 144 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 173459 |
| Missing (%) | 2.2% |
| Memory size | 59.0 MiB |
Length
| Max length | 35 |
|---|---|
| Median length | 30 |
| Mean length | 7.6729617 |
| Min length | 3 |
Characters and Unicode
| Total characters | 57968727 |
|---|---|
| Distinct characters | 46 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 9 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Light Rain |
|---|---|
| 2nd row | Light Rain |
| 3rd row | Overcast |
| 4th row | Mostly Cloudy |
| 5th row | Mostly Cloudy |
| Value | Count | Frequency (%) |
| fair | 2596473 | |
| cloudy | 2576033 | |
| mostly | 1032703 | 9.9% |
| clear | 808743 | 7.7% |
| partly | 709213 | 6.8% |
| light | 545023 | 5.2% |
| rain | 509071 | 4.9% |
| overcast | 382866 | 3.7% |
| scattered | 204829 | 2.0% |
| clouds | 204829 | 2.0% |
| Other values (51) | 888209 | 8.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 5372144 | 9.3% |
| a | 5364570 | 9.3% |
| r | 4858585 | 8.4% |
| y | 4510118 | 7.8% |
| o | 4153193 | 7.2% |
| i | 3924138 | 6.8% |
| C | 3589627 | 6.2% |
| t | 3202474 | 5.5% |
| d | 3164721 | 5.5% |
| 2903057 | 5.0% | |
| Other values (36) | 16926100 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 57968727 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| l | 5372144 | 9.3% |
| a | 5364570 | 9.3% |
| r | 4858585 | 8.4% |
| y | 4510118 | 7.8% |
| o | 4153193 | 7.2% |
| i | 3924138 | 6.8% |
| C | 3589627 | 6.2% |
| t | 3202474 | 5.5% |
| d | 3164721 | 5.5% |
| 2903057 | 5.0% | |
| Other values (36) | 16926100 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 57968727 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| l | 5372144 | 9.3% |
| a | 5364570 | 9.3% |
| r | 4858585 | 8.4% |
| y | 4510118 | 7.8% |
| o | 4153193 | 7.2% |
| i | 3924138 | 6.8% |
| C | 3589627 | 6.2% |
| t | 3202474 | 5.5% |
| d | 3164721 | 5.5% |
| 2903057 | 5.0% | |
| Other values (36) | 16926100 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 57968727 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| l | 5372144 | 9.3% |
| a | 5364570 | 9.3% |
| r | 4858585 | 8.4% |
| y | 4510118 | 7.8% |
| o | 4153193 | 7.2% |
| i | 3924138 | 6.8% |
| C | 3589627 | 6.2% |
| t | 3202474 | 5.5% |
| d | 3164721 | 5.5% |
| 2903057 | 5.0% | |
| Other values (36) | 16926100 |
Amenity
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.4 MiB |
| False | |
|---|---|
| True | 96334 |
| Value | Count | Frequency (%) |
| False | 7632060 | |
| True | 96334 | 1.2% |
Bump
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.4 MiB |
| False | |
|---|---|
| True | 3514 |
| Value | Count | Frequency (%) |
| False | 7724880 | |
| True | 3514 | < 0.1% |
Crossing
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.4 MiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 6854631 | |
| True | 873763 | 11.3% |
Give_Way
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.4 MiB |
| False | |
|---|---|
| True | 36582 |
| Value | Count | Frequency (%) |
| False | 7691812 | |
| True | 36582 | 0.5% |
Junction
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.4 MiB |
| False | |
|---|---|
| True | 571342 |
| Value | Count | Frequency (%) |
| False | 7157052 | |
| True | 571342 | 7.4% |
No_Exit
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.4 MiB |
| False | |
|---|---|
| True | 19545 |
| Value | Count | Frequency (%) |
| False | 7708849 | |
| True | 19545 | 0.3% |
Railway
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.4 MiB |
| False | |
|---|---|
| True | 66979 |
| Value | Count | Frequency (%) |
| False | 7661415 | |
| True | 66979 | 0.9% |
Roundabout
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.4 MiB |
| False | |
|---|---|
| True | 249 |
| Value | Count | Frequency (%) |
| False | 7728145 | |
| True | 249 | < 0.1% |
Station
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.4 MiB |
| False | |
|---|---|
| True | 201901 |
| Value | Count | Frequency (%) |
| False | 7526493 | |
| True | 201901 | 2.6% |
Stop
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.4 MiB |
| False | |
|---|---|
| True | 214371 |
| Value | Count | Frequency (%) |
| False | 7514023 | |
| True | 214371 | 2.8% |
Traffic_Calming
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.4 MiB |
| False | |
|---|---|
| True | 7598 |
| Value | Count | Frequency (%) |
| False | 7720796 | |
| True | 7598 | 0.1% |
Traffic_Signal
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.4 MiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 6584622 | |
| True | 1143772 | 14.8% |
Turning_Loop
Boolean
CONSTANT 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.4 MiB |
| False |
|---|
| Value | Count | Frequency (%) |
| False | 7728394 |
Sunrise_Sunset
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 23246 |
| Missing (%) | 0.3% |
| Memory size | 59.0 MiB |
| Day | |
|---|---|
| Night |
Length
| Max length | 5 |
|---|---|
| Median length | 3 |
| Mean length | 3.6153276 |
| Min length | 3 |
Characters and Unicode
| Total characters | 27856634 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Night |
|---|---|
| 2nd row | Night |
| 3rd row | Night |
| 4th row | Night |
| 5th row | Day |
Common Values
| Value | Count | Frequency (%) |
| Day | 5334553 | |
| Night | 2370595 | |
| (Missing) | 23246 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| day | 5334553 | |
| night | 2370595 |
Most occurring characters
| Value | Count | Frequency (%) |
| D | 5334553 | |
| a | 5334553 | |
| y | 5334553 | |
| N | 2370595 | |
| i | 2370595 | |
| g | 2370595 | |
| h | 2370595 | |
| t | 2370595 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 27856634 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| D | 5334553 | |
| a | 5334553 | |
| y | 5334553 | |
| N | 2370595 | |
| i | 2370595 | |
| g | 2370595 | |
| h | 2370595 | |
| t | 2370595 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 27856634 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| D | 5334553 | |
| a | 5334553 | |
| y | 5334553 | |
| N | 2370595 | |
| i | 2370595 | |
| g | 2370595 | |
| h | 2370595 | |
| t | 2370595 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 27856634 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| D | 5334553 | |
| a | 5334553 | |
| y | 5334553 | |
| N | 2370595 | |
| i | 2370595 | |
| g | 2370595 | |
| h | 2370595 | |
| t | 2370595 |
Civil_Twilight
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 23246 |
| Missing (%) | 0.3% |
| Memory size | 59.0 MiB |
| Day | |
|---|---|
| Night |
Length
| Max length | 5 |
|---|---|
| Median length | 3 |
| Mean length | 3.5216069 |
| Min length | 3 |
Characters and Unicode
| Total characters | 27134502 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Night |
|---|---|
| 2nd row | Night |
| 3rd row | Night |
| 4th row | Day |
| 5th row | Day |
Common Values
| Value | Count | Frequency (%) |
| Day | 5695619 | |
| Night | 2009529 | 26.0% |
| (Missing) | 23246 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| day | 5695619 | |
| night | 2009529 | 26.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| D | 5695619 | |
| a | 5695619 | |
| y | 5695619 | |
| N | 2009529 | 7.4% |
| i | 2009529 | 7.4% |
| g | 2009529 | 7.4% |
| h | 2009529 | 7.4% |
| t | 2009529 | 7.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 27134502 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| D | 5695619 | |
| a | 5695619 | |
| y | 5695619 | |
| N | 2009529 | 7.4% |
| i | 2009529 | 7.4% |
| g | 2009529 | 7.4% |
| h | 2009529 | 7.4% |
| t | 2009529 | 7.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 27134502 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| D | 5695619 | |
| a | 5695619 | |
| y | 5695619 | |
| N | 2009529 | 7.4% |
| i | 2009529 | 7.4% |
| g | 2009529 | 7.4% |
| h | 2009529 | 7.4% |
| t | 2009529 | 7.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 27134502 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| D | 5695619 | |
| a | 5695619 | |
| y | 5695619 | |
| N | 2009529 | 7.4% |
| i | 2009529 | 7.4% |
| g | 2009529 | 7.4% |
| h | 2009529 | 7.4% |
| t | 2009529 | 7.4% |
Nautical_Twilight
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 23246 |
| Missing (%) | 0.3% |
| Memory size | 59.0 MiB |
| Day | |
|---|---|
| Night |
Length
| Max length | 5 |
|---|---|
| Median length | 3 |
| Mean length | 3.4228321 |
| Min length | 3 |
Characters and Unicode
| Total characters | 26373428 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Night |
|---|---|
| 2nd row | Night |
| 3rd row | Day |
| 4th row | Day |
| 5th row | Day |
Common Values
| Value | Count | Frequency (%) |
| Day | 6076156 | |
| Night | 1628992 | 21.1% |
| (Missing) | 23246 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| day | 6076156 | |
| night | 1628992 | 21.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| D | 6076156 | |
| a | 6076156 | |
| y | 6076156 | |
| N | 1628992 | 6.2% |
| i | 1628992 | 6.2% |
| g | 1628992 | 6.2% |
| h | 1628992 | 6.2% |
| t | 1628992 | 6.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 26373428 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| D | 6076156 | |
| a | 6076156 | |
| y | 6076156 | |
| N | 1628992 | 6.2% |
| i | 1628992 | 6.2% |
| g | 1628992 | 6.2% |
| h | 1628992 | 6.2% |
| t | 1628992 | 6.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 26373428 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| D | 6076156 | |
| a | 6076156 | |
| y | 6076156 | |
| N | 1628992 | 6.2% |
| i | 1628992 | 6.2% |
| g | 1628992 | 6.2% |
| h | 1628992 | 6.2% |
| t | 1628992 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 26373428 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| D | 6076156 | |
| a | 6076156 | |
| y | 6076156 | |
| N | 1628992 | 6.2% |
| i | 1628992 | 6.2% |
| g | 1628992 | 6.2% |
| h | 1628992 | 6.2% |
| t | 1628992 | 6.2% |
Astronomical_Twilight
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 23246 |
| Missing (%) | 0.3% |
| Memory size | 59.0 MiB |
| Day | |
|---|---|
| Night |
Length
| Max length | 5 |
|---|---|
| Median length | 3 |
| Mean length | 3.3446008 |
| Min length | 3 |
Characters and Unicode
| Total characters | 25770644 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Night |
|---|---|
| 2nd row | Day |
| 3rd row | Day |
| 4th row | Day |
| 5th row | Day |
Common Values
| Value | Count | Frequency (%) |
| Day | 6377548 | |
| Night | 1327600 | 17.2% |
| (Missing) | 23246 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| day | 6377548 | |
| night | 1327600 | 17.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| D | 6377548 | |
| a | 6377548 | |
| y | 6377548 | |
| N | 1327600 | 5.2% |
| i | 1327600 | 5.2% |
| g | 1327600 | 5.2% |
| h | 1327600 | 5.2% |
| t | 1327600 | 5.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 25770644 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| D | 6377548 | |
| a | 6377548 | |
| y | 6377548 | |
| N | 1327600 | 5.2% |
| i | 1327600 | 5.2% |
| g | 1327600 | 5.2% |
| h | 1327600 | 5.2% |
| t | 1327600 | 5.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 25770644 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| D | 6377548 | |
| a | 6377548 | |
| y | 6377548 | |
| N | 1327600 | 5.2% |
| i | 1327600 | 5.2% |
| g | 1327600 | 5.2% |
| h | 1327600 | 5.2% |
| t | 1327600 | 5.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 25770644 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| D | 6377548 | |
| a | 6377548 | |
| y | 6377548 | |
| N | 1327600 | 5.2% |
| i | 1327600 | 5.2% |
| g | 1327600 | 5.2% |
| h | 1327600 | 5.2% |
| t | 1327600 | 5.2% |